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Abstract — Erasure coding techniques are used to increase the 
reliability of distributed storage systems while minimizing stor- 
age overhead. Also of interest is minimization of the bandwidth 
required to repair the system following a node failure. In a 
recent paper, Wu et al. characterize the tradeoff between the 
repair bandwidth and the amount of data stored per node. They 
also prove the existence of regenerating codes that achieve this 
tradeoff. 

In this paper, we introduce Exact Regenerating Codes, which 
are regenerating codes possessing the additional property of 
being able to duplicate the data stored at a failed node. Such 
codes require low processing and communication overheads, 
making the system practical and easy to maintain. Explicit 
construction of exact regenerating codes is provided for the 
minimum bandwidth point on the storage-repair bandwidth 
tradeoff, relevant to distributed-mail-server applications. A sub- 
space based approach is provided and shown to yield necessary 
and sufficient conditions on a linear code to possess the exact 
regeneration property as well as prove the uniqueness of our 
construction. 

Also included in the paper, is an explicit construction of re- 
generating codes for the minimum storage point for parameters 
relevant to storage in peer-to-peer systems. This construction 
supports a variable number of nodes and can handle multiple, 
simultaneous node failures. All constructions given in the paper 
are of low complexity, requiring low field size in particular. 

I. Introduction 

Reliability is a major concern in large distributed storage 
systems where data is stored across multiple unreliable stor- 
age nodes. It is well known that adding redundancy increases 
the reliability of the system but at the cost of increased 
storage. Erasure coding based techniques [1], [2] (eg. using 
Maximum distance separable(MDS) codes) have been used 
to minimize this storage overhead. 

In a distributed storage system, when a subset of the 
nodes fail, the system needs to repair itself using the existing 
nodes. In erasure coding based systems, each node stores a 
fragment of an MDS code. Upon failure of a node, the failed 
fragment can be restored back using the existing fragments. 
The amount of data that needs to be downloaded to restore the 
system after a node failure is one of the significant parameters 
of a distributed storage system. In [3] the authors introduce a 



new scheme called Regenerating Codes which store a larger 
amount of data at each node compared to an MDS code, 
in order to reduce the repair bandwidth. In [4] the authors 
establish a tradeoff between the amount of storage required at 
each node and the repair bandwidth. Two extreme and practi- 
cally relevant points on this storage-repair bandwidth tradeoff 
curve are the minimum bandwidth regeneration(MBR) point 
which represents the operating point with least possible repair 
bandwidth, and the minimum storage regeneration(MSR) 
point which corresponds to the least possible amount of 
data stored at a node. By an optimal Regenerating Code, we 
will mean a Regenerating Code that meets the storage-repair 
bandwidth tradeoff. 

A principal concern in the practical implementation of 
distributed storage codes is computational complexity. A 
practical study of the same has been carried out in [7] for 
random linear regenerating codes. Although the existence 
of optimal regenerating codes was proved in [4], for code 
construction, the authors have suggested the use of a general 
network-coding-based code construction algorithm due to 
Jaggi et al [5]. The drawbacks of such an approach include 
high complexity of code construction as well as the require- 
ment of a large field size. 

In this paper, we introduce Exact Regenerating Codes, 
which are regenerating codes possessing the additional prop- 
erty of being able to regenerate back the same node upon 
failure. We give a low-field-size, low-complexity, explicit 
construction for exact regenerating codes at the MBR point. 
Using the subspace based approach provided, we also prove 
that our code is unique among all the linear codes for this 
point. Explicit construction is also given for regenerating 
codes at the MSR point for suitable parameters which can 
handle multiple node failures. To the best of our knowledge, 
our codes are the first explicit constructions of optimal 
regenerating codes. 

In [6], Wu et al. also independently introduce the notion of 
exact regeneration(termed exact repair in [6]) for the MSR 
point. However, the codes introduced in their work do not 
meet the storage-repair bandwidth tradeoff. The construction 



proposed by them is of high complexity and also has the 
disadvantage of a large field size requirement. 

The rest of the paper is organized as follows. In Section 
[il] we introduce the notion of Exact Regenerating Codes. 
Explicit construction of Exact Regenerating Codes for the 



MBR point is given in Section III The complexity and the 
field size requirement of the proposed code construction algo- 



rithm are also analyzed here. In Section IV a subspace based 
approach to construction of these codes is provided which 
is later used to prove the uniqueness of our construction. 
Section [V] provides a construction of regenerating codes for 
the MSR point for certain practically relevant parameters. 



Finally, conclusions are drawn in Section VI 



II. Exact Regenerating Codes 




stores a symbols 

Fig. 1. An illustration of exact regeneration: On failure of node 5, data 
from nodes 1 to 4 is used to regenerate back the same data that node 5 
earlier had. 



The system description is as follows. All data elements 
belong to a finite field ¥ q of size q. The total size of the 
file is B units. The data is stored across n storage nodes 
in a distributed manner where each node can store up to a 
units of data. A data collector(DC) connects to any k out of 
these n nodes to reconstruct the entire data. This property 
is termed as 'reconstruction property'. The data collector is 
assumed to have infinite capacity links so that it downloads 
all the data stored in these k nodes. 

When a node fails, a new node is added in its place by 
downloading (3 units of data from any d(> fcQ out of the 
remaining n — 1 nodes. In regenerating codes as introduced 
in [4], the new node formed need not be identical to the 
failed one. It should satisfy the reconstruction property along 
with the existing nodes. This property wherein a new node 
satisfying reconstruction can be created as a replacement for 
a failed node is termed as 'regeneration property' . Any other 
node subsequently regenerated using this node should satisfy 
both the properties. Hence the new node along with all other 
nodes should satisfy these properties for a possibly infinite 
sequence of failures and regenerations. 



We introduce a desirable property into regenerating codes 
wherein the regenerated node is identical to the one which 
failed. We will call regenerating codes having this additional 
property as 'Exact Regenerating Codes'. Fig. [T] shows an 
example of the this scheme. As a failed node is replaced by an 
identical node, Exact Regenerating Codes have to satisfy the 
reconstruction property at only one level. Also, the additional 
communication and processing overheads required to update 
all the other nodes and data collectors about the new node is 
completely avoided. This makes the storage system practical 
and easy to maintain. 

III. Exact Regenerating codes for the MBR point 

The MBR point is the fastest recovery point (on the 
storage-repair bandwidth tradeoff curve) in terms of the data 
to be downloaded for regeneration per unit of the source data. 
Also, among all the possible values of d, d = n— 1 point gives 
the fastest recovery as all the existing nodes simultaneously 
help in the regeneration of the failed node. Hence the MBR 
point with d = n — 1 is highly suitable for applications such 
as distributed mail servers, where it is crucial to restore the 
system in the shortest possible time. 

This section gives the construction of linear exact regen- 
erating codes at the MBR point for d = n — 1 and any k. 
At the MBR point, optimal a and (3 on the storage-repair 
bandwidth tradeoff curve are given by (from [4]): 



(otMBR, PmBR.) — 



2Bd 



2B 



(1) 



2kd-k 2 + k'2kd-k 2 + k 

Clearly for a feasible system we need (3 to be an integeiQ 
Assume (3 to be the smallest possible positive integer, i.e. 
(3 = 1. Then we have 

k(k - 1) 



B = kd 



and 



= d 



(2) 



(3) 



For any larger file size, the source file is split into chunks 
of size B, each of which can be separately solved using the 
construction for (3=1, Reconstruction and regeneration will 
be performed separately on these smaller chunks and hence 
additional processing and storage required to perform these 
operations is greatly reduced. 

A. Code construction 

Denote the source symbols of the file by / = 

(jo fx h fs-if- Let d = n - 1 and 9 = &±&. 
Let V be a n x 8 matrix with the following properties: 

1) Each element is either or 1. 

2) Each row has exactly d l's. 

3) Each column has exactly two l's. 



'From [4], if d < k, the mincut condition will require data reconstruction 
to hold for d nodes, hence k can be set as d. 



2 It can be seen from equation |TJ that if /3 is an integer, then a and B 
are also integers. 




Fig. 2. Fully connected undirected graph with 5 vertices. Vertices represent 
nodes and edges represent vectors corresponding to the common symbol 
between two nodes. 

4) Any two rows have exactly one intersection of l's. 

It is easy to see that V is the incidence matrix of a fully 
connected undirected graph with n vertices. Our construction 
of exact regenerating codes for the MBR point uses the 
above described matrix V. Consider a set of 9 vectors 
{v lt v 2 , . . . ,v e } which form a S-dimensional MDS code. 
The vectors (i — 1 , . . . , 8) are of length B with the 
constituent elements taken from the field ¥ q . Node j stores 
the symbol /*t^ if and only if V(j, i) = 1. Thus in the graph 
corresponding to V, vertices represent the nodes, and edges 
represent the vectors corresponding to the symbols stored. 
Thus, by the properties of the matrix V, we get n nodes 
each storing d(= a) symbols. Properties 3 and 4 ensure that 
each row intersects every other row in distinct columns. The 
validity of this code as a exact regenerating code for the 
MBR point is shown below. 

Data Reconstruction: The DC connects to any k out of 
the n storage nodes and downloads all the ka symbols 
stored. As any two rows of the matrix V intersect in only 
one column and any row intersects all other rows in distinct 
columns, out of the ka symbols downloaded, exactly (*) 
symbols are repetitions and do not add any value. Hence the 
DC has ka — (z) = B distinct symbols of a £?-dimensional 
MDS code, using which the values of the source symbols 
/o> • • • > Jb-x can be easily obtained. 

Exact Regeneration: The matrix V provides a special 
structure to the code which helps in exact regeneration. 
Properties 3 and 4 of the matrix V imply that the each 
of the existing n — 1 nodes contain one distinct symbol 
of the failed node. Thus exact regeneration of the failed 
node is possible by downloading one symbol each from the 
remaining n — 1 nodes. 

In section [IV] it will be proved that this code construction 
scheme is unique for linear exact regenerating codes up to 
the choice of vectors v 2 , ... , v e }. In the above 

description we have chosen these set of vectors to form a 
£?-dimensional MDS code. In fact, it suffices if the vectors 
are chosen such that, for any set of k nodes, the constituent 
vectors are linearly independent. 



B. Example 

Let n = 5, k = 3. We get d = n - 1 = 4 and 8 = 10. 
Putting (3=1 gives a — 4 and B = 9. As described in the 
previous section, the matrix V is the incidence matrix of a 
fully connected undirected graph with 5 vertices (Fig. [2J as 
given below: 
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Thus the 5 nodes store the following symbols: 

Node 1: /V 2 , 1%, 1%} 

Node 2: {£%, f%, fv 6 , fv 7 } 

Node 3: {fv 2 , fv 5 , fv s , fv 9 } 

Node 4: {fv 3l fv 6 , fv 8 , fv w } 

Node 5: {fv 4 , fv 7 , fv 9 , fv w } 

Reconstruction: Suppose the data collector connects to 
nodes 1, 2 and 3. It can retrieve the symbols / v_ x ,. . . ^Vg, 
and using these, recover the source symbols fo> Th e 
same holds for any choice of 3 nodes. 

Regeneration: Suppose node 3 fails. Then, node 1 gives 
/*u 2 , node 2 gives / « 5 , node 4 gives f Vg and node 5 gives 
f v 9 . All these four symbols are stored as the new node 3. 
Thus the regenerated node 3 stores exactly the same symbols 
as the failed node. 

In this example, 8 = B + 1 and hence we can take the 
vectors {v lt . . . ,v 10 } to form a single parity check code of 
dimension 9. So the exact regenerating code for this set of 
parameters can be obtained in F 2 . 

C. Field size required 

The required field size is the minimum field size required 
to construct a [8, B] MDS code. If we use a Reed-Solomon 
code, the minimum field size required for our construction 
turns out to be 8(= n(n— 1)/2). In [4] authors have suggested 
to cast the problem of constructing deterministic regenerat- 
ing codes as a virtual multicast network code construction 
problem and then use the algorithm due to Jaggi et al. [5] to 
determine the network coefficients. This algorithm requires 
field size of the order of number of sinks, which in this 
case leads to a very high field size. In fact, the problem of 
exact regenerating code construction leads to a non-multicast 
network code problem for which there are very few results 
available [8], [9]. 

D. Complexity 

Code construction: Code construction is immediate given 
the incidence matrix V of a fully connected graph with n 
vertices. No arithmetic operations are required. 

Node Regeneration: The method used for regeneration 
does not require the existing nodes to perform any additional 



operations. Each existing node just has to pass one symbol 
to the new node from the set of a symbols stored in it. 

If the regeneration is not exact, additional communication 
to the nodes and data collectors about changes in the code co- 
efficients is necessary. Also, all the nodes need to recalculate 
the vectors which they have to pass for subsequent regener- 
ations. In the case of exact regeneration, these overheads are 
avoided. 

Data Reconstruction: To facilitate the DC to easily decode 
the downloaded data, one set of k nodes can be made 
systematic, i.e. these k nodes will store the source symbols 
without any encoding. This can be achieved by performing a 
change of basis on the S-dimensional vector space spanned 
by the vectors {t^, v 2 i ■ • ■ , =$}, so tnat tne desired k nodes 
have entire data in uncoded form. Hence, if the DC preferably 
connects to this set of k nodes, no decoding is necessary. 

If regeneration is not exact, the systematic property cannot 
be maintained. When any of one of the k nodes chosen to 
be systematic fails, the regenerated node may not be in the 
systematic form and hence the property will be lost. 

IV. SUBSPACE VIEWPOINT AND UNIQUENESS 

In the construction of exact regenerating codes given in 
nodes were viewed to be storing a symbols 



III 



section 

each from a finite field. In this section, we provide an 
alternative viewpoint based on subspaces which completely 
characterizes linear exact regenerating codes for the MBR 
point for any values of (n, k, d). By a linear storage code, 
we mean that any symbol stored is a linear combination of 
the source symbols, and only linear operations are allowed 
on them. 

The subspace viewpoint will be used to prove the necessary 
and sufficient conditions for a linear storage code to be an ex- 
act regenerating code for the MBR point. This subsequently 
leads to the uniqueness of our construction. 

Define a vector / of length B consisting of the source 



symbols (as in section IIIi. Since each source symbol can 
independently take values from F g , the B source symbols 
can be thought of as forming a £>-dimensional vector space 
over ¥ q . 

Since the code is linear, any stored symbol can be written 
as /*| for some vector I. These vectors which specify the 
linear combinations define the code, and the actual symbols 
stored depend on the instantiation of /. Since a node stores a 
symbols, it can be considered as storing a vectors of the code, 
i.e. node i stores the vectors £± , . . . ,1^- Linear operations 
performed on the stored symbols are equivalent to the same 
operations performed on these vectors. Hence we say that 
each node stores a subspace of dimension at most a i.e. 



node i: Wi 



(i) 



f(i) 



. n 



where Wi denotes the subspace stored in node i , i = 1 
and (.) indicates the span of vectors. 

For regeneration of a failed node, d other nodes provide (3 
symbols each. We say that each node passes a subspace of 



dimension at most j3. 

Consider the exact regeneration of some node i using any 
d out of the remaining n — 1 nodes. Denote this set of d 
nodes by D, and let j £ D. Let s £, denote the subspace 
passed by node j for the regeneration of node i. 

In the following lemmas, we prove certain subsapce prop- 
erties associated with linear exact regenerating codes at the 
MBR point. 

Lemma 1: For any (n, k, d) linear exact regenerating 
code for the MBR point, each node stores an a-dimensional 
subspace, i.e. 

dim{Wi} = a, Vi 6 {l,...,n}. 

Proof: Consider data reconstruction by a DC connecting 
to any k nodes, Ai,. . . ,Afc. Let these k nodes store subspaces 
with dimensions fix,. ..,fifc respectively. As each node can 
store a subspace of dimesion at most a, 



Qi < a, Vi £ {!,..., k} 



(4) 



For the DC to be able to reconstruct all the data, the 
dimension of the sum space of these k subspaces should be 
B, i.e. 



dim{W Al + Wa 2 



W Ak } = B 



(5) 



Using the expression for the dimension of sum of two 
subspaces recursively we get, 

dim{W Al H h W Ak } 

= dim{W Al } + dim{W A2 } - dim{W Al n W Aa } 
+dim{W As } - dim{W A3 n {W Al + W A2 }} 

+dim{W Ak } - dim{W Ak n {W Al + ■■■ + ^ k _J} 

k 

= Y^dim{W Ai } 
i=i 

~dim{W Ak n {w Ak _, +... + w Al }} 

dim{W A3 n{W A2 +W Al }} 

-dim{W A2 DW Al } (6) 



< 



-{Q k -(d-{k-l))L3)+ 

(n 3 -(d-2)/3)+ 

-(0 2 -(d-l)/3) + 

k 
1=2 



(7) 
(8) 



= fl 1 + (k-l)df3-{k-l + --- + 2+ 1}(3 (9) 
= fl 1 -a + B (10) 
<B (11) 

In (x) + stands for max(x, 0). The justification for ^ 



is as follows. Suppose nodes A l7 . . . , Aj_i and some Othel- 
lo d — (I — 1)) nodes participate in the regeneration of node 
A;. The maximum number of linearly independent vectors 
that the (d — (I — 1)) nodes (other than Ai,. .. , A;_i) can 
contribute is (d — (I — l))/3- If this quantity is less than Hi 
then the I — 1 nodes under consideration will have to pass the 
remaining dimensions to node I. Hence for any I = 2, . . . , k 

■ + W Al }} 

> (Oj-(d-(Z-l))/3)+ (12) 

Equation ^ follows by the property that any two non- 
negative numbers yi and y^ satisfy the inequality (yi — (yi — 



dim{W Al n {W Al _, 



i/2) + ) < Vi - Equation ( jToj t follows from (TTjiand equation ( 1 1 
from Q. Now, for equation |5]l to hold, ^1 1| > should be satis- 
fied with equality, which forces = a. Similarly, expanding 
with respect to other nodes, and considering different sets of 
k nodes, we get dim{Wi} = a, Vi S {1, . . . , n}. ■ 
Corollary 2: Let D m be any subset of D of size to, where 
to < k. For any (n, k, d) linear exact regenerating code at 
the MBR point, 



dim { \Y, M { W i 



— mf3 



Proof: Putting il/ = a — d(3 in ( [L?) we get, 
dim(W Ai n{W Al _ 1 +--- + W Al }) > (1-1)13 (13) 
Using ^ and ([13), 

dim {W Al +■■■ + W Ak } 

k 

= ^dim{W Ai } 

i=l 

-dim{W Ak n {W^ +■■■ + W Al }} 

dim{W A3 n{W A2 + W Al }} 

-dim{W A2 nW Al } (14) 

< ka-{k-l-\ h2+ l}/3 (15) 

= B (16) 

For equation |5| to hol d, ([T5] > should be satisfied with 
equality. This along with ( 13 1 gives the result. ■ 
Note that putting m = 1 gives 



dim{Wi^W 3 } = 13 



(17) 



Lemma 3: For any (n, k, d) linear exact regenerating 
code at the MBR point, 



Wi n Wj 



Also, the subspaces Wi D Wj are linearly independent Vj 6 
D. 

Proof: Consider the exact regeneration property of node 
i. As d(3 — a, node i must store all the information passed 
by the nodes in D. Hence, the subspace passed by node j 



must be a subspace of Wi as well, i.e. 

sjj, c {Wi n Wj) 

Also, 



E dim{s^%} 
je{D} 



> dim 




dim {Wi] 

d/3 

(i) 



(18) 



(19) 

(20) 
(21) 



which along with the fact that dim{S] J£)} < (3 implies that 
equation ( [T9| should be satisfied with equality and 

dim{Sf D } = (3 
From equations ([17), ( fT8] l and ( |22) , it follows that 



(22) 



Wi n w, 



(23) 



19 1 implies that the subspaces £, are 



Equality of equation 
linearly independent Vj" € D 

Hence for any linear exact regenerating code for the MBR 
point, each node should store an a dimensional subspace, 
and the intersection subspaces of a node with any d other 
nodes should have dimension (3 each and should be linearly 
independent. 

The following theorems prove the uniqueness of our code 
for the MBR point. 

Theorem 4: Any linear exact regenerating code for the 
MBR point with d = n — 1 should have the same subspace 
properties as our code and hence the same structure as our 
code. 

Proof: Let C be an exact regenerating code obtained via 
our construction. Let C be another optimal exact regenerat- 
ing code for the MBR point which satisfies the reconstruction 
and exact regeneration properties. Let W[,...,W' n be the 
subspaces stored in nodes 1, . . . ,n respectively in code C. 
Apply Lemma [3] to node 1 in C and let s' 2 , . . . , s' n be the 
/3-dimensional intersection subspaces of node 1 with nodes 
2, . . . , n respectively. As s' 2 , . . . ,s' n are linearly independent 
subspaces spanning a dimensions, they constitute a basis 
for W{ and hence can be replaced as the contents of node 
1. Now consider node 2. One of the intersection subspaces 
will be s' 2 (with node 1). Let S3, . . . , s" be the intersection 
subspaces of node 2 with nodes 3, . . . , n. Again, s' 2 and 



'3' • 



, s" form a basis for W' 2 and hence node 2 can be 



replaced by these. Continuing in the same manner across all 
the remaining nodes, it is easy to see that the code C has 
the same structure as C. ■ 

Hence our code is unique upto the choice of basis for the 
node subspaces. 

Theorem 5: A necessary and sufficient condition for any 
linear code to be (n, k, d) exact regenerating code for the 
MBR point is that any set of d + 1 nodes should have the 
same structure as our code. 

Proof: Necessity: If there exists a linear exact regener- 



ating code at the MBR point for some (n, k, d), then any set 
of d + 1 nodes from this code should work as a code for the 
parameters (d + 1, k, d). Hence, from Theorem|4] any set of 
d + 1 nodes is of the same structure as our code. 

Sufficiency: Suppose there exists a linear code such that 
any set of d + 1 nodes from this code has the same structure 
as our code. Consider a DC connecting to some k nodes. 
This set of k nodes can be viewed as a subset of some d + 
1 nodes which will have the same structure as our code. 
Hence, the DC can reconstruct the entire data. Consider a 
failed node, and some d nodes used to regenerate it. Since 
this set of d + 1 nodes will also have the same structure as 
our code, exact regeneration of the failed node is possible. 
Thus, reconstruction and exact regeneration properties are 
established. ■ 

V. Regenerating Codes for the MSR point 

The MSR point requires the least possible storage at the 
nodes (with respect to the storage-repair bandwidth tradeoff 
curve). This operating point particularly suits applications 
like storage in peer-to-peer systems where storage capacity 
available from the participating nodes is very low. In such 
systems, multiple node failures are quite frequent as nodes 
enter and exit the system at their own will. Hence the system 
should be capable of regenerating a failed node using only a 
small number of existing nodes. Also, the number of nodes 
in the system changes dynamically. Hence the code should 
work even if the number of nodes keeps varying with time. 

In this section we give an explicit construction for regener- 
ating codes at the MSR point for d = k + 1 and any n. This 
set of parameters makes the code capable of handling any 
number of failures provided that at least k + 1 nodes remain 
functional. Note that, by definition, if less than k nodes are 
functional then a part of the data will be permanently lost. 
If exactly k nodes are functional, then these nodes will have 
to pass all the information stored in them for regeneration, 
hence no optimization of the repair bandwidth is possible. 

At the minimum storage point, optimal a and /3 on the 
storage-repair bandwidth tradeoff curve are given by (from 
[4]): 



and 



(28) 



(atMSR, PmSr) = 



D 



D 



k' k(d- k + 1) 



(24) 



By the same argument as in the MBR case, we choose 
(3 = 1 for our construction, which gives 



B = k(d- k + 1) 



and 



d-k + l 



(25) 
(26) 



A. Code construction: 



With d = k + 1, from equations (25i and (26 1 we have 

B = 2k (27) 



Partition the source symbols into two sets: /o,...,/fe-i, 
and g ,...,g k -i- Let /* = (/ /i ... fk-i), and g* = 

(go gi ■■■ gk-i)- 

Node i (i = 1, . . . , n) stores (f p. , g t p i + f %) as its 
two symbols. We shall refer to the vectors p i and as the 
main vector and the auxiliary vector of a node respectively. 
The elements of the auxiliary vectors are known but can take 
any arbitrary values from F f/ . The main vectors are the ones 
which are actually used for reconstruction and regeneration. 

Let the set of main vectors p.[i = 1, ...,n) form a k- 
dimensional MDS code over ¥ q . The required field size is 
the minimum field size required to construct an [n, k] MDS 
code. If we use a Reed-Solomon code, the minimum field 
size required turns out to be just n. 

For example, consider n — 5, k — 3 and d — 4. We 
have B = 6 and fo, fx, f2, go, g\ and g^ as the source 
symbols. Let the main vectors p. (i — 1, . . . , n) form a Reed- 
Solomon code, with p. = (1 9i Off. 9i (i — 1, . . . , 5) take 
distinct values from ¥ q (q > 5). We can initialize elements 
of u^i = 1, . . . , 5) to any arbitrary values from F g . 

B. Reconstruction: 

A data collector will connect to any k nodes and download 
both the symbols stored in each of these nodes. The first 
symbols of the k nodes provide f p. at k different values of 
i. To solve for /, we have k linear equations in k unknowns. 
Since p.'s form a fc— dimensional MDS code, these equations 
are linearly independent, and can be solved easily to obtain 
the values of / , . . . , fu-i ■ 

Now, as / and are known, / can be subtracted out 
from the second symbols of each of the k nodes. This leaves 
us with the values of g t p i at k different values of i. Using 
these, values of go, . . . , gt-i can be recovered. 

Thus all B data units can be recovered by a DC which 
connects to any k nodes. We also see that reconstruction is 
possible irrespective of the values of the auxiliary vectors u i . 

C. Regeneration: 

In our construction, when a node fails, the main vector 
of the regenerated node has the same value as that of the 
failed node, although the auxiliary vector is allowed to be 
different. Suppose node j fails. The node replacing it would 
contain (f l p . , g l p . + f JL) where elements of m 7 - can take 
any arbitrary value from ¥ q and are not constrained to be 
equal to those of Uy As the reconstruction property holds 
irrespective of the values of Uj, the regenerated node along 
with the existing nodes has all the desired properties. 

For regeneration of a failed node, some d nodes give one 
(as (3 = 1) symbol each formed by a linear combination of 
the symbols stored in them. Assume that node A^+i fails 
and nodes Ai, . . . , are used to regenerate it, where the 
set {Ai, . . . , Ad+i} is some subset of {1, . . . , n}, with all 
elements distinct. 



Let a; and bi (i = 1 , . . . , d) be the coefficients of the 
linear combination for the symbol given out by node Aj. Let 
Vi = a,i(fp A ,)+b l (g t p A +fu Ai ) be this symbol. Let S t and 
Pi (i = 1 } . . . , d) be the coefficients of the linear combination 
used to generate the two symbols of the regenerated node. 
Thus the regenerated node will be 



' d d \ 

SiVi , ^2 Pivi 



(29) 



Choose bi = 1 (i — 1, . . . , d). Now choose p L (i — 1. . . . , d) 
such that 



E 



and 5j (i = 1 , . 



, d) such that 

d 



2a, 



= 



(31) 



d = k+1 unknowns each. Since p. 's form a fc— dimensional 



MDS code these can be solved easily in ¥ q . This also ensures 



that we can find a solution to equation (31 1 with all Si's 
non-zero. 

Now, choose dj (i = 1, . . . , d) such that 



5 i( a iP Al + 6 ^A.) = =A d+1 



(32) 



(=i 



node is to be minimized and where the number of nodes in 
the system varies with time. The codes given for both end 
points of the storage-repair bandwidth tradeoff have a low 
field size requirement and are of low complexity. 



[l] 



[3] 
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Equations ( 30 1 and ( 3 1 1 are sets of k linear equations in [7] 
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i.e 



5^w Ai =P Ad+i 

1=1 



5ibiU A . 



(33) 



Equation ( 33 i is a set of k linear equations in d — k + 
1 unknowns which can be easily solved in ¥ q . Since none 
of the Si (i — l,...,d) are zero, the particular choice of 
P A 's used guarantees a solution for a, (i = 1, . . . , d). Hence, 
regeneration of any node using any d other nodes is achieved. 

VI. Conclusion 

In this paper, the notion of Exact Regenerating Codes was 
introduced in which a failed node is replaced by a new node 
which is its exact replica. Optimal Exact Regenerating Codes 
meet the storage-repair bandwidth tradeoff and have several 
advantages such as the absence of communication overhead 
and a low runtime processing requirement in comparison with 
more general regenerating codes. An explicit construction of 
exact regenerating codes for the MBR point with d = n — 1 
was provided, which is well suited for applications such as 
mail servers that call for fast recovery upon failure. Subspace 
viewpoint was used to prove the uniqueness of our code. 
At the MSR point, an explicit construction for regenerating 
codes for d = k + 1 was given, that is suitable for peer-to- 
peer storage systems where the amount of data stored in each 



